How far are vowel formants from computed vocal tract resonances?
نویسندگان
چکیده
We compare numerically computed resonances of the human vocal tract with formants that have been extracted from speech during vowel pronunciation. The geometry of the vocal tract has been obtained by MRI from a male subject, and the corresponding speech has been recorded simultaneously. The resonances are computed by solving the Helmholtz partial differential equation with the Finite Element Method (FEM). Despite a rudimentary exterior space acoustics model, i.e., the Dirichlet boundary condition at the mouth opening, the computed resonance structure differs from the measured formant structure by ≈ 0.7 semitones for [i] and [u] having small mouth opening area, and by ≈ 3 semitones for vowels [a] and [ae:] that have a larger mouth opening. The contribution of the possibly open velar port has not been taken into consideration at all which adds discrepancy for [a] in the present data set. We conclude that by improving the exterior space model and properly treating the velar port opening, it is possible to computationally attain four lowest vowel formants with an error less than a semitone. The corresponding wave equation model on MRIproduced vocal tract geometries is expected to have a comparable accuracy.
منابع مشابه
Vowel formants compared with resonances of the vocal tract
We compare numerically computed resonances of the human vocal tract with formants that have been extracted from speech during vowel pronunciation. The geometry of the vocal tract has been obtained by MRI, and the corresponding speech has been recorded simultaneously. The resonances are computed by solving the Helmholtz partial differential equation with the Finite Element Method (FEM).
متن کامل3d Geometry of the Vocal Tract and Inter-speaker Variability
Three speakers of French (two males and one female) were the subjects of an MRI analysis of the vocal tract during the production of sustained isolated French vowels. From a 3D reconstruction of the vocal tract, area functions were determined for each vowel, and the corresponding formant values were computed with a harmonic model of the vocal tract. Using the computation of the sensitivities of...
متن کاملVocal-Tract Resonances as Indexical Cues in Rhesus Monkeys
Vocal-tract resonances (or formants) are acoustic signatures in the voice and are related to the shape and length of the vocal tract. Formants play an important role in human communication, helping us not only to distinguish several different speech sounds [1], but also to extract important information related to the physical characteristics of the speaker, so-called indexical cues. How did for...
متن کاملThe Role of Lower Airway Resonances in Defining Vowel Feature Contrasts by Steven
Since the voicing source is located between the lower and upper airways and has a high impedance, the resonances of the lower airway appear as pole-zero pairs in vowel spectra. These pole-zero pairs interact non-linearly with the vocal tract formants, producing narrow frequency bands within which formant structure is unstable. The broader frequency bands between lower airway resonances are thus...
متن کاملLinking loudness increases in normal and lombard speech to decreasing vowel formant separation
The increased vocal effort associated with the Lombard reflex produces speech that is perceived as louder and judged to be more intelligible in noise than normal speech. Previous work illustrates that, on average, Lombard increases in loudness result from boosting spectral energy in a frequency band spanning the range of formants F1-F3, particularly for voiced speech. Observing additionally tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1208.5963 شماره
صفحات -
تاریخ انتشار 2012